Clustering gene expression profile data by selective shrinkage

نویسندگان

  • Hemant Ishwaran
  • J. Sunil Rao
چکیده

Clustering of gene expression profiles is a widely used approach for finding macroscopic data structure. A complication in such analyses is that not all genes are informative for forming clusters and different clusters might have different transcription regulation. Driven by these considerations, we present a novel two-stage clustering approach. The first stage identifies informative genes by adaptive variable selection using pseudo-samples modeled by a high dimensional multigroup ANOVA model. Variables are selected using a rescaled spike and slab Bayesian hierarchical model having a special selective shrinkage property. The second stage uses output from the first stage for clustering. We demonstrate why selective shrinkage occurs, and by extension, why it is useful for the clustering paradigm. We analyze a human gene atlas expression dataset where the question of interest is to look for tissue-specific transcription regulation and investigate whether tissues can be grouped together due to similar genomic control. c © 2008 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis

Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...

متن کامل

خوشه‌بندی داده‌های بیان‌ژنی توسط عدم تشابه جنگل تصادفی

Background: The clustering of gene expression data plays an important role in the diagnosis and treatment of cancer. These kinds of data are typically involve in a large number of variables (genes), in comparison with number of samples (patients). Many clustering methods have been built based on the dissimilarity among observations that are calculated by a distance function. As increa...

متن کامل

Evaluation of β-actin as a Reference Gene for Comparative Expression Analysis of Equine Adipose- and Bone Marrow-Derived Mesenchymal Stem Cells by qRT-PCR

Background Bone marrow and adipose tissue are two main sources of mesenchymal stem cells (MSCs). Some of studies suggest that there are some differences in gene expression profile of MSCs-derived from various tissues. To investigate gene expression profile by qRT-PCR, an appropriate reference gene with stable expression level should be chosen for normalizing data.  This study was designed to e...

متن کامل

Gene Expression Profile Analysis during Mouse Tooth Development

Introduction: Complex molecular pathways involve in development of different tissues such as teeth. Differential gene expression patterns during teeth development generates different tooth types. Teeth development results from interactions between oral epithelium and underlying ectomesenchyme cells with neural crest origin. Teeth development are regulated by different signaling networks. In thi...

متن کامل

EFFECT OF AEROBIC TRAINING AND ETHANOL CONSUMPTION ON LIPID PROFILE AND GENE EXPRESSION OF SOME GASTROCNEMIUS MUSCLE MYOKINES IN MALE RATS

Background: Skeletal muscle as an endocrine tissue is involved in the regulation of metabolic activity, production and secretion of hormones including myokines. The aim of the present study was to investigate the effect of eight weeks of aerobic training combined with ethanol consumption on plasma lipid profile and glucose levels, triglyceride content and mayonectin, irisin and leptin gene expr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008